Classical Arabic Named Entity Recognition Using Variant Deep Neural Network Architectures and BERT
نویسندگان
چکیده
Recurrent Neural Networks (RNNs) and transformers are deep learning models that have achieved remarkable success in several Natural Language Processing (NLP) tasks since they do not rely on handcrafted features nor enormous knowledge resources. Named Entity Recognition (NER) is an essential NLP task used many applications such as information retrieval, question answering, machine translation. NER aims to locate, extract, classify named entities into predefined categories person, organization location. Arabic considered a challenging because of the complexity unique characteristics Arabic. Most previous research based-Arabic focused Modern Standard Dialectal Arabic, which different variations from Classical In this paper, we investigate learning-based using neural network architectures BERT based contextual language model trained general domain text. We propose two RNN-based by fine-tunning pretrained recognize The pre-trained representations were input BGRU/BLSTM fine-tuned dataset. addition, explore variant proposed BERT-BGRU/BLSTM-CRF models. Experimentations showed BERT-BGRU-CRF outperformed other achieving F-measure 94.76% CANERCorpus. To best our knowledge, first work learning.
منابع مشابه
Named Entity Recognition in Persian Text using Deep Learning
Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...
متن کاملApplying Recurrent Neural Network to Arabic Named Entity Recognition
This technical report collects the final reports of the undergraduate Computer Science majors from the Qatar Campus of Carnegie Mellon University who elected to complete a senior research thesis in the academic year 2015–16 as part of their degree. These projects have spanned the students’ entire senior year, during which they have worked closely with their faculty advisors to plan and carry ou...
متن کاملArabic Named Entity Recognition
Stemming is the process of reducing words to their stems or roots. Due to the morphological richness and complexity of the Arabic language, stemming is an essential part of most Natural Language Processing (NLP) tasks for this language. In this paper, we study the impact of different stemming approaches on the Named Entity Recognition (NER) task for Arabic and explore the merits, limitations an...
متن کاملNeural Architectures for Named Entity Recognition
State-of-the-art named entity recognition systems rely heavily on hand-crafted features and domain-specific knowledge in order to learn effectively from the small, supervised training corpora that are available. In this paper, we introduce two new neural architectures—one based on bidirectional LSTMs and conditional random fields, and the other that constructs and labels segments using a transi...
متن کاملNeural Network Based Named Entity Recognition
Czech named entity recognition (the task of automatic identification and classification of proper names in text, such as names of people, locations and organizations) has become a well-established field since the publication of the Czech Named Entity Corpus (CNEC). This doctoral thesis presents the author’s research of named entity recognition, mainly in the Czech language. It presents work and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2021
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2021.3092261